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Abstract. By proving a local limit theorem for higher-order transitions, we 
determine the time required for necklace chains to be close to stationarity. 
Because necklace chains, built by arranging identical smaller chains around a 
directed cycle, are not reversible, have little symmetry, do not have uniform 
stationary distributions, and can be nearly periodic, prior general bounds on 
rates of convergence of Markov chains either do not apply or give poor bounds. 
Necklace chains can serve as test cases for future techniques for bounding rates 
of convergence. 
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1. Introduction 

Determining the rate of convergence to stationarity of finite ergodic Markov 
chains is a problem of both theoretical and practical interest, with applications to 
sampling and estimation problems. It is also a difficult problem. Over the last 20 
years, many techniques have been developed for bounding convergence behavior; 
see Aldous and Fill [Q, Behrcnds or Lovasz for surveys. Coupling, strong 
stationary time, and finite Fourier analysis arguments all exploit chain symmetry. 
Second-largest eigenvalue techniques and inequalities inspired by differential geom- 
etry generally require reversibility; symmetrized versions of those bounds, due to 
Diaconis and Saloff-Coste Q and Fill ||], can be applied to non-reversible chains, 
but only those with strong types of aperiodicity. 

Despite this plethora of methods, the time required to be close to stationary has 
been determined precisely — with, say, a correct leading term constant — only for 
certain families of chains (Diaconis Q gives a survey focussed on chains whose time 
to be near stationarity displays sharp cutoffs). The current work adds the family of 
necklace chains to the list. Necklace chains have little symmetry, have non-uniform 
stationary distributions, are not reversible, and can be nearly periodic (due to 
deterministic transitions). Thus existing general bounds on rate of convergence are 
difficult to apply to necklaces. We hope our results will allow necklace chains to 
serve as test cases for techniques for bounding rates of convergence, and we provide 
an example of this utility. 

Necklaces are built from smaller chains. A bead B is a Markov chain with states 
{0, 1, . . . , b} such that 

• The only absorbing state is b, and every state lies on some path from to b. 

• The set of possible first passage times from to 6 has minimal span 1. 



Date: February 1, 2008. 

Research supported by the AT&T Bell Laboratories Graduate Research Program for Women 
and the AT&T Laboratories Fellowship Program. 



2 



ELIZABETH L. WILMER 



(a) 




Figure 1. The underlying graphs of (a) a bead B, (b) its closure 
B, and (c) a necklace chain built with B. Here, n — 9, to — 5, and 
r = (1, 0, 1, 0, 1, 1, 1, 0, 0). Entrance, exit, and link states are black. 



State is the entrance state of B and state b is the exit state. The closure B of the 
bead B has the same transitions as B, except that B(b 7 0) — 1. See Figure |l|(a,b). 

To construct P r , a necklace built with bead B and indicator vector r — (ro, . . . , r n _i), 
start with link states so, si, . . . , s n ~i- 

• If ri = 1, there is a bead at position i: attach Si and s^+i via a bead isomorphic 
to B with states s^o = Si, s^i, . . . , a^j = Sj+i. The states in the bead at 
position i are s^o = Sj, i, . . . , s^b-i- (We exclude the link state s^b = s.; + i 
from the i-th bead.) 

• If r, = 0, i/iere is no£ a 6ead a£ position i: attach Sj and Sj+i with a directed 

edge, and set P r (s,-, s,- + 1) = 1. 

Indices are taken mod n, so that r n — l determines how s n -i is connected to so- See 
Fi gure |l|(c). Let Ri — X^fc=o r fe an< ^ ^ TO — ^» be the total number of beads. The 
Markov chain P r is always irreducible; it is aperiodic as long as to > 1. 

In a family of necklace chains {P n } = {Pn '■ n G N}, P n has n link states, has 
to = m(n) beads (1 < m < n), and has indicator vector r = r(n). We generally 
suppress the dependence on n of these and other chain parameters. 

Our main results are Theorem [l], which approximates the higher-order tran- 
sitions of families of necklaces, and Theorem B which describes the asymptotics 
of their distance from stationarity. When rescaled appropriately, families of neck- 
laces behave like random walk on the cycle Z/rtZ. They converge gradually to 
stationarity and have no cutoffs. The time to be near stationarity depends only 
on the number of beads, not on their arrangement, and can range from O (\S\ 2 ) to 
0(\S\ 3 ), where |S| is the size of the state space. 
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Let <f>(x) — — h=e x I 2 be the standard normal density and let 



be the density at time c of Brownian motion on the circle R/Z of unit circumference. 
We measure distance between distributions in total variation: 

IK) - t'OIItv = sup HA) - n'(A)\ =IJ2 ~ ^Wl" 



Theorem 1. Let B be a bead with first passage time from entrance to exit of mean 
\i and variance a 2 . Let {P n } be a family of necklace chains built with bead B. Let 
7r n be the stationary distribution of P n . Fix c > and let 

, . c(n + (u — l)m) 3 _ . . 

t = t(n) = - —„ + O (n) . 

am 

For each n, let s be either sq or a state in a bead at position n — 1 , and let s' be 
either Si or a state in a bead at position i. Then as n — * oo, 

\ n+(/x — l)m J \n J 
Theorem 2. Under the assumptions of Theorem as n oo, 
\\p^ s ,.)-n n (-)\\ TV ^ \ f \e c (x)-l\dx. 



Section 2 collects for reference some standard facts and computes the stationary 
distributions and the higher-order transitions of necklace chains. In Section 3, 
we prove Theorems |l| and p[ The argument applies a local central limit theorem 
due to Petrov, a classical large deviation bound, and Markov chain identities to 
the combinatorial expressions for the higher-order transitions. Section 4 examines 
several families of necklace chains, including one (with well-behaved multiplicative 
reversibilizations) for which a Nash inequality of Diaconis and Saloff-Coste gives 
the correct rate of convergence, up to a constant factor. 

2. Preliminaries 

First we state (for future reference) the specializations of two standard results 
on Markov chains to B. See, for example, Aldous and Fill Q. 

Proposition 1. Let B be a bead, and let X be a random variable distributed as the 
first-passage time from to b in B (or, equivalently, B). Then there exist a positive 
integer no and a constant < a < 1 such that n > no implies Pr(X > ri) < a™. 

Proposition 2. Let B be a bead, let tt be the stationary distribution of its closure 
B, and let fi be the expected first passage time from to b in B. Then, for any 
state k ^ b of B, 



5^ J B°(0,*)=7r(fc)(A* + l). 
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Proof. This sum is the expected number of visits to state k when the chain B is 
started in and run until hitting b, which is equal to the corresponding quantity for 
the irreducible chain B. By a standard result (see, e.g., Aldous and Fill 0, Chapter 
2, Lemma 9]), this is 

w(k) (E T b + E b T k - E T k ) , 

where E{Tj is the expected passage time between i and j in B. But EoT b — fi by 
definition, while B(b, 0) = 1 forces E b T k - E T k = 1. □ 

We now compute the stationary distribution and higher-order transitions of neck- 
lace chains. Proposition ^, which expresses the higher-order transitions of P n ,m in 
terms of sums of i.i.d. random variables, is our central combinatorial argument. 

Proposition 3. Let P n , m be a necklace chain with n link states and m beads of 
type B. Let n be the stationary distribution of B and let /j = EoT b in B. Then the 
stationary distribution K n ,m of P n . m is 



Proof. The function 



(At + lMfc) 

J n + (u — l)m 
1 



n + (/x — l)m 



if s = s^fc is in a bead, 
otherwise. 



ir(k) if s = s i k is in a bead, 
7r(fe) otherwise 



satisfies J2 S ' Pn,m(s, s')f(s) — f(s'). Why? Non-backbone states are only accessi- 
ble from other states in their own bead, so we are just verifying stationarity of tt for 
B. Backbone states not in beads receive and emit a steady flow of 7r(&). Backbone 
states in beads receive 7r(6) (which in B would come from the state b) from what 
precedes them, whether bead or backbone state. 

To normalize, note that J2 S f( s ) — ( n ~ — ir(b))+mir(b). Because B(b, 0) = 
1, E b T+ = /i + 1 and n(b) = 1/(^+1). □ 



Proposition 4. Let P n , m be a necklace chain with n link states and m beads of 
type B. Let Xi,X2, ... be i.i.d. random variables distributed as the first passage 
time from to b in the bead B, and set Sj — X\ + X2 H h Xj . Then 



(1) 



^ Pr [Smj+R, = t-i + Ri-(n- m)j] 

when s — Si is not in a bead, 
B a (0, k) Pr [Smj+R, =t-a-i + Ri-(n- m)j] 

P n,m( s 0,s) = \ a>0 j>0 

when s — Si^ k is in a bead, i 7^ 0, 
Y, B a (0, k) J2 Pr [Smj+R, =t-a-i + Ri-(n- m)j] + B\0, k) 

a>Q j>0 

when s — sq^- is in a bead at position 0. 
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When s n -i.i is in a bead at position n 
(2) 



P Lm( s n-l,l,s) = { 



when sis not in the bead at position n — 1, 
Y / B a (l,b)P r t - n a l (so,s) + B t (l,k) 



when s = s n _i u is in the bead at position n — 1. 



Proof. In the first case of (0) , j counts the number of times the random walk has 
gone around the entire chain. By time t, the walk has gone through mj + Ri 
beads and (n — m)j + (i — Ri) deterministic steps. Thus it is at s, exactly when 
Smj+R, + (ra - m)j + Ri) = t. 

For the second case of (Q), let Xt be the state occupied at time t, and consider a 
trajectory of length t from sq to s^fc. By the same reasoning as for the first case, 

Pv[x t = Sj and x t _\ is not in the bead at position i\x§ = s ] 

= ^2 Pr [Smj+R(i) = t-i + R(i) - (n - m)j] . 

Let t — a be the time of the last arrival at = s^q from the i — 1-st bead before 
time t. In order to stay in the z-th bead until arriving at s^fc at time t, we must 
avoid the state Sj+i = s^b. 

In the third case, it is possible that the chain never leaves the bead at position 

0. 

For (0), note that when s is not in the bead at position n — 1, then we must pass 
through s n —i t b = s o on the way to s. When the target state is in the same bead, 
there are also paths that stay within a single bead. □ 



3. Proofs of Limit Theorems 

The key to proving Theorem [l], and thus Theorem ^, is Lemma |l| an approxi- 
mation of the sums appearing in Proposition ^. While the terms of this sum are 
simply probabilities of sums of i.i.d. random variables taking on particular values, 
each term is taken from a different distribution. We approximate the large terms 
with a local central limit theorem due to Petrov. The largest terms, which are very 
close to those taken from a single distribution, build a theta function. The rest 
of the terms are negligible — most are handled by the local central limit theorem, 
while a Chernoff-type large deviation bound covers those terms for which only a 
few random variables are added. 

Lemma 1. Let Xi,X%, ... be i.i.d. positive-integer-valued random variables with 
maximal span 1 such that for some a < 1 and no, n > n Q implies Pr[Xj > n] < a" . 

Let n = E [Xi], a 2 = Var [JQ], and S n = X 1 H \- X n . 

Fix c > 0. Let m = m(n), r = r(n) and t — t(n) be positive-integer-valued 
functions such that 1 < m < n, < r < n, and 

, . c(n + (u — l)m) 3 _ , . 
t = t{n) = -i ^ >—±- +0{n). 
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t-(u- l)r \ / 1 
o 



Then, as n — > oo and uniformly in r, 

VPr[S mj+r = f + r-(n-m)j] = — 1 t 

n + (p — l)m \n + [fi — l)m J \n 

Proof of Lemma []. First we use the given bound on the tail of the distribution of 
the XiS to build a simple Chernoff-style large deviation bound (see Chernoff [|| 
or Chapter 2 of Janson, Luczak, and Rucihski ||). We can couple X\ to a shifted 
geometric random variable Y such that X\ < Y always and 



Pr[r< > n + n ] 

But then, for any a, 



if n > 0, 
otherwise. 



Pr[S„ = an] < Pr[e* 5 " > e tan ] < ( inf e-* a E [e tXl ] J < (inf e~* a E [e' F ] 
(1 - a) (a - n ) ,l ° / a(a - n + 1) 



(a — n + l) n o _1 \ a—n 

The first fraction inside the exponential above is a rational function of a; the second 
decreases exponentially for a sufficiently large. Thus, there must exist a Co > [i and 
a k > such that, for a > cq, 

(3) Pr [5„ = an] < e ' kan . 

Because t + r — (n — m)j > cq {mj + r) is equivalent to j < n+{c° } -i)m ' inequality (||) 
now implies 

I t-(c -l)r | | t-(cp-l)r | 

Ln+(e -l)mJ L»+(<=0-l)"-J 

(4) £ Pr[5 roi+r =t + r-(n-m)j]< ]T e -*(M»-™)i) 
i=o i=o 

< 



*-(o>-lM e _ fc ( t -(„- m) ( -^)) 
n + (co — l)m / 

e - fcc o( „ t + ( r - ) : ) = ^ e -n ( „) = 



= 

The given tail bound implies that the X^s have moments of all orders, so we 
may now apply a local central limit theorem for lattice random variables due to 
Petrov [ pr] , taking only one term in the asymptotic expansion and specializing to 
the lattice of integers: as N — ► oo, 

n — 1 1 v 

(5) sup 1 



a - fiN 



Pr [S N = a\ ■ ' - 



o-VN \ aVN J J \y/N 



Because the underlying random variables, JQ, are positive integers, all terms of our 
sum for which mj + r > t + r — (n — or equivalently j > — , are zero. Given 
the estimate of equation (|]) , we can restrict our attention to those terms for which 
t ~ (c °~\ )r <j<±. On this range, 

ti-\-{cq— l)m J — n 05 

(6) mj + r = Q (n 2 ) . 

Let 
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and 

t + r — (n — m)j — fJ.(mj + r) (n + (p— l)m)(J — xq) 



Because 



Vi = 



t-(cg-l)r 
n+ 



a^/mj + r a^J mj + r 



<j <i implies y s = 6(1) (j - x ), 
^ 1 + \Vi\ 

„ +(co _ 1)m <J<7I 

Combining (||), @, and (Q) now yields 

^ Pi" [S^j+r = t + r - (n - m)j] 



n+(c -l)r. 



t- eo-l)r ^ t V 
»+(c -l)m n 

We first consider the terms of the resulting sum for which \j — xq\ < n 1 / 4 . On this 
range, 

m 1 



/ C (n+( M -l)m)' nr + to(j ) 

V cH n+(/j-— l)m ^ J u ' 



V5(n + Q* l)m)yi + 0(^) v^(« + (M - 1)-) 



,7/4 



and 

/ n x (n + (//- l)m)(i-x ) i-zo i-x / 1 

(9) % = / ■ , = / ; = = 7=~ + ~7= 

CT ^ + r ^c + Ol^) ^ Vv^ 
-By @, @ and the boundedness of the derivative of (f>, 

2 aVmi + / (%) = S (^(n + (,i - l)m) + ° 

1 ±fj-XQ 



Vc(n + (/x - l)m) ^ V v^ / \™ 5/4 

#c (xo) , / 1 
o \ — 



n + (/i — l)m 

The remaining terms are those for which t ~^ Ca ~}\ r < 7 < in-n 1 / 4 or xn+«. 1/ ' 4 < 

ft n+(co — l)m u 

j < Since there are only O (n 2 ) such terms, each of which is O (^(n 1 ' 4 )), their 
sum is certainly o (— J . □ 

To complete the proof of Theorem [I] we need only check that transition probabil- 
ity factors weighting the inner sums in Proposition |] give the correct the stationary 
distribution factors in the final approximation. 
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Proof of Theorem ^|. First, note that Proposition [l] legitimates applying Lemma [l] 
to sums arising from Proposition ^. 

When s = sq and s' = Sj, where Si is not in a bead, applying Lemma [l] to the sum 
given by the first case of Proposition ||(|l]) and recalling Proposition || immediately 
yields the desired 

ot / ^ , ™ ft-i-(fi-l)Ri\ (I 
\ n + (/t — ljm / \n 

Next, we consider s' = s^fe, a state in a bead at position i. Here, the second case 
of Proposition ||(|l]) applies. First we apply Lemma|l]to each term; by Proposition || 
we can collect the error terms. Then we use the upper bound on B a (0, k) implied 
by Proposition [I] and the boundedness of both 9 C and its derivative to truncate 
the sum, approximate the theta function factor in the remaining terms by its value 
when a — 0, and de-truncate, all with small enough error. Finally, Proposition |^ 
and Proposition [| evaluate the remaining sum. 

P n,m(S0, S i)k ) = > —, TT— 9c —, 7T + O - 

^^n+(/i — ljm \ n+{[x—l)m J \n 



^ o n+(^-l)m\ c \ n+(n-l)m J VvW/ \n 
E a >oB a (0,k) Q ft-i-Qi-l)^ , /I 



n + (// — l)m \ n + (/i — l)m / \n 
, \ a ft-i- (u- 1)Ph\ (\ 
\ n + (/i — ljm J \n 

Furthermore, Proposition |l| ensures that for large n the extra term in the third case 
of Proposition [|([l]) can be absorbed into the error. 

We must still consider s = s n -\j (where I ^ 6), a state in a bead at position n— 1. 
Now Proposition^^) applies. This time, the weights sum as J2 a >o B a (l,b) = 1, 
because the chain B started at I will eventually hit the absorbing state b. Thus 
the error terms can be collected. As above, Proposition [l] covers the extra term in 
the second case of Proposition (|(^|). From this point, we truncate, approximate, 
de-truncate, and sum as above, obtaining 



Pl m { Sn - U ,s')=Y J B a {l,b)Kn(s')e, 
a>0 

\/ri 



t-a-i-(u-l)Ri\ (I 
■ 1 o - 



n + (/i — l)m / \n 



a=0 
a>0 



7r n ( S ') £ B a (l, b)[6 c [ \ \ ^ ) + O 



' -«-(M-iW +0 /MV A 

n+(/z-ljm / \V n JJ \ n 



^ — ' \ n+(/i — ljm / \n 



7T„(S )0 C — ; + O - 

n + (// — ljm / \n 



□ 



We now substitute Theorem [l] into the L 1 expression for the total variation distance 
and collect terms to obtain a Riemann sum. 



A LOCAL LIMIT THEOREM FOR A FAMILY OF NON-REVERSIBLE MARKOV CHAINS 9 



Proof of Theorem [|. Let f(x) = \6 e (x) — 1| and define zq,Z\,..., z n -i € R/Z by 

-t 



n+ (jjl — l)m 

1 + (//- 1)^ 
n + (/i — l)m 



i = 
Ki<n-1. 



The «j's cover the entire circle R/Z with spacing of O (A); the size of the interval 
between z n _i and z$ is determined by the presence or absence of a bead at position 
n — 1. 

Without loss of generality, we may assume that s — sq or s is in a bead at 
position n — 1. Then Theorem |l| implies 



2 ||P*( S , •) - TTnOllyv = £ *0 - *«(«') 



t — i — (p — l)Ri 
n + (/i — l)m 

< - 1 - (At - iygj 

n + (a* — l)m 



+ o(l) -1 
o(l). 



If s' = Si is not in a bead, then 7r„ (s') = — , 1 .,, — = z, — z,_i . If there is a bead 
at position i, then Proposition ^ r,* = 1, and 7r(6) = —j^ imply that 



b-l 



(1-7t(6))(m+1) _ M 



n + (a* — 1)771 77 + (a* — 1)771 



Now group states by position and recall that / is an even function: 

n— 1 - 

2||P*( S! 0-7r„(-)|| w = V(^-^-i)/(^) + o(l)= / f{x)dx + o{\) 

, „ ./R/Z 



2 = 



Remark. Let t/q , yf, y% , ■ ■ ■ be trajectory of P n , and define, for s 6 5, 

/ x * + 0* - - i , • • , , • • ■ 

zt(s) = -. r whenever s = s,- or s is in a bead at position i. 

77 + (At — 1)777 

Then 

^"( T ) = Z \ (n+().-lH 3 T I I ^1 (n+fg-lWOg, 



□ 



is a cadlag random function [0,1] — > R. Using a random-change-of-time argument 
parallel to Billingsley's proof of a functional limit theorem for the renewal process [jjj 
pp. 148-50], it can be shown that the sequence Y 1 ,Y 2 , . . . converges weakly to 
Brownian motion on the circle R/Z. 
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4. Examples 

Fixed number or fixed fraction of beads. When m(n) — m is constant, {P n } 
converges to stationarity on a cubic time scale, 

(n + {fx - l)m) 3 1 . 3 , , 2 , 
2 = ~2^7 l n ) + \ n ) ■ 

Now let m(n) — [kn\, where < k < 1. Then {P„} converges to stationarity on 
the quadratic time scale 

{n + (p - 1) [kn\) 3 _ (kfi-k + 1) 3 , 2 . 

A simple bead I: rearranging beads. Consider the bead B with state space 
{0, 1} and transition matrix (q o )• Then B*(0, 1) = qp 1 ' 1 , so fi = | and er 2 = 
The closure -B has stationary distribution 7r(0) = 7r(l) = Necklace chains 
built with £? are just directed cycles, some of whose states have a hold probability 
of p. When {P n } is a family of such chains, Proposition || implies 



gn+pm(n) 



if s is in a bead, 



— , q , ? otherwise, 



while Theorem || implies {P n } converges to stationarity on a time scale of 

^ 1Q ^ (n + (p- l)m(n) f = (qn + pm(n)) 3 

<7 2 m(n) pqm(n) 

Figure ^|(a) shows the underlying graphs of two chains built with this bead: one 
alternates hold states with forced transitions, the other groups its holds together. 
Since they have the same number of beads, these chains should evolve at the same 
rate. Figure ||(b) shows the higher-order transition probabilities of these two chains 
at identical times (about .08 times the time scale of convergence). Notice the effects 
of the two-valued stationary distributions. Figure ||(c) shows the same weights 
respaced according to the points at which Theorem [j] evaluated 8 C , so that the 
value of Pn(0, s) is attached to ^+(^1)^ ■ tw0 a PP aren t "curves" correspond 
to the two values of the stationary distribution. Figure g(d) normalizes the weight 
at t ~n+^-\)m'' ^ Trj(^i)- The convergence claimed in Theorem [l] is now clear. 

A simple bead II: optimizing convergence. First, let m(n) = n, so that {P n } 
is a family of walks on Z/nZ. Many techniques for analyzing convergence apply, 
including Theorem]^: the time scale of convergence is n 2 /pq, which is fastest when 
p = q= 1/2. 

Now consider a family {P n } with holds at [kn\ states, < k < 1. One might 
expect that p = 1/2 would optimize convergence again, since the variance per coin 
flip is maximized. However, this family converges at time scale 

(qn+p[kn\) 3 = {q + pk) 3 ^ + Q 
pq \kn\ pqk 

The coefficient of n 2 is minimized when 



-k + Vk 2 
P= — i 
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Figure 2. (a) Underlying graphs of two homogeneous stop/go 
chains; n — 50, m = 25, R(i) = [(i + 1)/2J and R(i) = min(i, n/2). 

(b) P*(0, •) for the two chains above; p — 2/3 and t — 530. 

(c) P^(0, •) rearranged, (d) P^(0, •) rearranged and normalized. 



This optimizing probability is a decreasing function of k which approaches 1 as 
k — > and 1/2 as k — * 1 . 

Comparison with other methods. While it is generally difficult to apply the 
usual techniques for bounding rates of convergence to our nearly-periodic examples, 
there is a linked pair of examples which are susceptible. For these, we find that a 
Nash inequality of Diaconis and Saloff-Coste is correct, up to a constant factor, 
while the symmetrized second-largest-eigenvalue bound of Fill l| is off by a log 
factor. 
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2 • 



. : * 

n-l 1 



Figure 3. The underlying graphs of P„ and P£ 



• • • • 

n_1 1 / n-l 1 \ 

2 • 



Figure 4. The underlying graphs of M(P n ) and -?T„ = M{P^~ l ). 



Let 0<p<l,g=l — p, and let 



Pn 











1 



so that P 



n-l 






1 





... 





P 


<1 


... 








p 










'■ q 
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p 



See Figure [|. The chain P n has one bead and n — 1 link states; its bead has \i = 1+q 
and <7 2 = j?g. Theorem || gives a convergence time scale of (n — p) 3 /pq for {P n } ■ 

However, P™ _1 is also a necklace chain, with n link states and n — 1 simple 
beads; the outer cycle has changed direction. By equation (|i"o|), the family {P™ -1 } 
has convergence time scale (n — p) 3 /pq(n— 1) (consistent with the time scale for 

{Pn})- 



Symmetrize and compare. When P is a Markov chain, let P be the time reversal 
of P and M(P) — PP be the multiplicative symmetrization of P. Fill || showed 
that 

(11) ||^o,-) -A-)\\ TV < ~ /, 3i(M(P)Y /2 , 



where fii(M(P)) is the second largest eigenvalue of M(P). 
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1 1 1 q 

n—p n—p ' ' ' n—p n—p 



1 1 



PI 
n—p 



1 1 



4n-4 2n-2 



(x)K n (x,y) 



4n—4 



n—p n—p 



g g g g gg gg 

n(x) • •— — • • Tr(x)K n (x,y) • •— — • • 



n—p 



1 1 



2n-2 4n-4 



00 00 _ 00 00 

tt(x) • • • • w(x)K n (x,y) • • • • 



2n-2 n-1 " ' ' n-1 2n-2 

Figure 5. Stationary distributions and edge weights of K and K. 



4n—4 



Unfortunately, the underlying graph of M(P n ) is almost completely disconnected 
and thus px{M{P n )) = 1 (see @ for several similar examples). However, 



K n = M{P%- 1 ) 



pq p 2 + q 2 pq 



pq p 2 + q 2 



pq 
1-pq 



... pq 

has underlying graph an n-path (see Figure ^J). Because K n has non-trivial edge 
weights, we cannot compute (3i(K n ) directly. Specializing a comparison result of 
Diaconis and Saloff-Coste M yields 



(12) ft(Jf„)<l- (min* n(x) 



TT n (x)K n (x,y) 



x TT n (x)J \ K n (x,y)>0 n n (x)K n (x,y) 



(l-A^n)), 



whenever K n is a chain on the same state space as K n with the property that 
K n {x, y) > implies K n (x, y) > 0; here 7r„ is the stationary distribution of K n . 
It is convenient to take 



1/2 1/2 
1/4 1/2 1/4 



1/4 1/2 1/4 
1/2 1/2 



the "lazy" simple random walk on the n-path. Substituting the stationary distri- 
butions and edge weights of K n and K n (as shown in Figure ||) and (3\{K n ) — 

\ + \ cos fijzr) mto (0) an( i (11) now gives 



pqn 



O 



2(n-l) 2 V("-!) 4 



and 



1/2 
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Since < tt(xq) < T^p' we have shown that O (n 2 logn) steps suffice for P% 1 
to be within a fixed distance of stationarity, while O (n 3 logn) suffice for P n . As 
can happen for bounds using only the second-largest eigenvalue, these results are a 
factor of log n larger than necessary. 

Nash inequalities. Diaconis and Saloff-Coste bound convergence to stationar- 
ity using Nash inequalities. Their results are often sharper than second-largest- 
eigenvalue bounds for slowly-converging chains. Here, we extract only a few pieces 
of their analysis. 

Given an edge set E on a state space S, let d(x, y) be the length of a shortest 
path from x to y via E. Let B{x, r) — {y : d(x, y) < r}. Given a measure it on 5, 
let V(x, r) = Y^yeBlx r) w (y)- Let 7 = m Mi, 9 1/) be the diameter of E. 

Say (S, E, it) has {A, d)-moderate growth if 

V(x,r) > \ f 7 -^-] for < r < 7. 



4 \ 7 

It is easily checked that the underlying graph and stationary distribution of M(P™ _1 ) 
have ( i + ^rr, lj -moderate growth. 

Proposition 5 (Diaconis and Saloff-Coste 0). Let K be a Markov chain on a fi- 
nite set S with stationary distribution it. Let M{K) be the multiplicative sym- 
metrization of K , and E the edge set of the underlying graph of M{K). Assume 
(S, E, 7r) has (A, d)-moderate growth. Let {"f X y} be a collection of paths in E joining 
each pair of states in S, and let 

_ 2 I Tt(z)tt(w) 

a - C^ErH(x)M(K)(x, y )) ^ ™ V{z,r) 

0<r< 7 1*^{x,y) 



d(z,w)<r 



Then 



1 



|P 4 (x ,-) "7r|| Ty < - ai e- m/{al \ for t = a 7 2 + m + l, 



when to > and ai ^ (e(l + d)A) 1 / 2 (4(2 + d)) d > A . 

Following example 5(a) of Q, we take the paths {jxy} in K n — M(P™~ V ) to be 
geodesies and bound a. From the values in Figure |^ we see that 



2 



< — : — t-= and 7r(z)7r(it;) < 



Tr(x)M(K n )(x 1 y) mhi(q 2 1 pq) (n - p) 2 

The number of terms in the sum is at most r(r + l)/2 and |7 ZU1 | < r. Finally, 
f I + ^TT: -moderate growth implies that 

V(z,r)> q ±±^. 

n — p 

Combining these estimates yields 

(l-pq) 2 



a < 



qmin(q 2 ,pq) 

We can conclude that O (n 2 ) steps suffice for the chain P" _1 to be close to sta- 
tionarity, and thus O (n 3 ) suffice for the chain P n . 
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Remark. Although it is difficult to compare the asymptotic statement of Theorem g 
and the direct inequalities of Proposition M, the Nash inequality estimate of the lead 
term constant appears to worsen as q decreases to 0. 

In order to force Pn^^lxn, ■) — n < e using Theorem 0, we need a c such 

TV " 

that 

v 1/2 

\e c (x)-i\dx) <e. 




Then t > c ^ n = — — h O (n) steps suffice. In Proposition ||, we must have 



m > log ^— J 



a7 2 , 



which implies 

4 > f 1 + \ log(48 £ 2 ) + 1 log f 1 -) log e ) (n - If + 1. 

As q — > 0, the lead term is O (-jg log ij n 2 , as opposed to O (j^J n 2 for the asymp- 
totic bound. 
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